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Mechanistic multi-stage models are used to analyze lung-cancer mortality after Plutonium ex¬ 
posure in the Mayak-workers cohort, with follow-up until 2008. Besides the established two-stage 
model with clonal expansion, models with three mutation stages as well as a model with two dis¬ 
tinct pathways to cancer are studied. The results suggest that three-stage models offer an improved 
description of the data. The best-fitting models point to a mechanism where radiation increases the 
rate of clonal expansion. This is interpreted in terms of changes in cell-cycle control mediated by 
bystander signaling or repopulation following cell killing. No statistical evidence for a two-pathway 
model is found. To elucidate the implications of the different models for radiation risk, several expo¬ 
sure scenarios are studied. Models with a radiation effect at an early stage show a delayed response 
and a pronounced drop-off with older ages at exposure. Moreover, the dose-response relationship is 
strongly nonlinear for all three-stage models, revealing a marked increase above a critical dose. 


I. INTRODUCTION 

Cancer is a genetic disease. In the widely held theory of 
somatic evolution [l|, a cell’s path toward the malignant 
state is portrayed as a series of mutations or epigenetic 
events, lending it successive selective advantages. These 
advantages, as summarized in the “hallmarks of cancer” 
[^, essentially amount to an increasingly uncontrolled 
proliferation. 

Those essential features—mutations accompanied by 
proliferation—have long been identified as key ingredi¬ 
ents in modeling carcinogenesis. Beginning with the sem¬ 
inal multi-step models by Armitage/Doll and Nordling 
g i, this eventually led to the stochastic two-stage 
model with clonal expansion due to Moolgavkar, Ven- 
zon, and Knudson which has by now become an 

established tool to understand and predict cancer risk 

[®. 

What fundamentally distinguishes such mechanistic 
models from conventional epidemiological ones is that 
they do not directly model the endpoint—say, the cancer 
mortality rate—but rather the process thought to lead 
to it, parametrized via mutation and proliferation rates. 
This may prove useful especially when the mortality rate 
is highly convoluted by an exposure to carcinogens such 
as ionizing radiation, as is the case in this study. 

Such a mechanistic approach can be readily generalized 
so as to build in known biological effects, such as multi¬ 
ple genetic pathways or a more realistic number 

of stages (Til . [T3 |. Indeed, for colorectal cancer, where 
the understandin g of the cellular mechanisms is compar¬ 
atively advanced (l^ Il4l|. a number of extended models 
have been put forward to account for the role of genomic 
instability [lOL[l5l - [l7l | , the rather large number of prema- 
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lignant stages [T^[T^. as well as the intricate dynamics 
during progression |l9l |. 

By contrast, far less is known regarding other cancer 
types. For lung cancer, mechanistic modeling studies are 
abundant but have focused almost exclusively on the two- 
stage model. These indicate that for the two main risk 
factors, smoking (20l - l^ and a-particle radiation (most 
relevant being Radon decay products) (23 - 13^ . the best 
description is afforded by an enhancement of the prolif¬ 
eration rate of premalignant cells. However, for radia¬ 
tion, this conclusion has been disputed [s^ because it 
lacks a conclusive biological mechanism, in contrast to 
the accepted mutagenic effect of radiation. This debate 
has been further sparked by the single analysis to date 
going beyond the two-stage model Comparing fits 
to the Colorado-miners data using the two-stage model 
with those from a subclass of three-stage models, the au¬ 
thors suggested that a proliferation effect were confined 
to the two-stage model, whereas a better fit quality was 
achieved within a three-stage framework with a muta¬ 
tional radiation action. 

The objective of this paper is to perform a com¬ 
parative analysis of lung-cancer risk associated with a- 
radiation using different multi-stage models - specifically 
two- and three-stage models as well as a two-path model 
for (radiation-induced) genomic instability. Our goal is 
to identify the mechanisms of radiation action suggested 
by those models, as well as to lay out to what extent 
their predicted radiation risks are consistent. To this 
end, we apply these models to the Mayak-workers cohort 
[ 3 ^. These workers, employed at the formerly So¬ 
viet Plutonium-production plant, have been exposed to 
substantial doses of ^^®Pu via inhalation and exhibit a 
large number of lung-cancer deaths, 895 in total [s^- A 
notable feature of Plutonium exposure is its strong pro¬ 
traction, which might facilitate the assessment of risk on 
the long time scales relevant to indoor Radon. Further¬ 
more, information on the strongest risk factor, smoking, 
is available for most workers. We will show that certain 
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three-stage models give an improved description of the 
data, and we elucidate how these lead to predictions for 
the risk that are qualitatively different from both two- 
stage mechanistic and standard descriptive models. 


II. MATERIALS AND METHODS 
A. Mayak-workers cohort 

The Mayak-workers cohort comprises nuclear workers 
at the Mayak Plutonium-production facility at Ozyorsk, 
Russia [^. The current follow-up includes all years 
1948-2008 and comprises 25,757 members, cf. Ref. js^l 
for a comprehensive overview. 

Many of the workers have been exposed to Plutonium- 
239, predominantly through inhalation. These inter¬ 
nal doses have been assessed via urine measurements 
combined with biokinetic modeling for about 40% of 
workers in the plants at risk [s^. Exposure to ex¬ 
ternal gamma radiation has been recorded via film- 
badge dosimeters js^, and average annual dose rates 
are available for all workers. Furthermore, for most 
workers, information exists on smoking status (non- 
/eversmoker) as well as on alcohol consumption (teeto- 
taler/light/medium/heavy/chronic), see below. Owing 
to pronounced smoking and Plutonium-inhalation pat¬ 
terns, the dominant cancer-mortality endpoint is lung 
cancer (dehned here as ICD-9 code 162), with a total 
of 895 mortality cases. 


1. Cohort definition 

To obtain a sufficiently homogeneous data set 
amenable to mechanistic modeling, several selection cri¬ 
teria have been applied, similar to previous studies 
[UlUl. Our reduced (sub)cohort excludes females, since 
these make up less than 25% of the whole cohort and ex¬ 
hibit very different mortality rates. Moreover, full infor¬ 
mation is required on smoking/alcohol status and annual 
internal dose(rate)s - i.e., ^^®Pu doses must be measured 
or assumed to vanish (for workers outside the radiochem¬ 
ical and Pu plants).!^ 

Finally, the follow-up period is restricted as follows. If 
a Pu measurement has been performed at time tpu, then 
the entry date is set to tpu -|- 2a. This is done to avoid 
selection bias, specifically healthy-survivor effects (due 
to extended follow-up periods for persons surviving until 
tpu) and diagnostic bias (in case the measurement has 
been caused by imminent health problems). To ensure 
complete follow-up, the exit date is cut off at the end of 
2008 or, in the case of migrants, 2003. 

The reduced cohort includes 8,604 persons and 388 
lung-cancer deaths. 


2. Risk factors 

Let us briefly highlight some aspects of the major risk 
factors (see Ref. for details). The main interest here 
is in internal radiation, with measured nonzero doses 
available for 3,667 persons. Due to slow degradation in 
the lungs and the long half-life of ^^®Pu, exposures are 
highly protracted: First exposure peaks around age 20, 
and typically continues until the end of follow-up. Cumu¬ 
lative lung doses are well described by a log-normal dis¬ 
tribution. Among those measured, it is peaked at 4mGy, 
much below the mean dose D = 0.12Gy. Restricted 
to lung-cancer cases, the dose distribution is shifted to 
higher values, with D = 0.44Gy and lower/upper 5% 
quantiles of 7.6mGy and 2.3Gy. 

The overall smoking fraction is about 3/4. Alcohol sta¬ 
tus, although not known to be a risk factor for lung can¬ 
cer, may serve as an indicator for smoking habits because 
the fraction of smokers increases with alcohol consump¬ 
tion. We group heavy/chronic drinkers as one category, 
a = 1, and otherwise set a = 0. 

As with any cancer, age is a crucial intrinsic risk fac¬ 
tor. The ages of cohort members range broadly between 
about 18 and 81 years, as defined by the lower/upper 5% 
quantiles of entry/exit age, with an average of 27 years 
spent in the cohort. By contrast, 90% of lung-cancer 
cases are found only between ages 49 — 78, with a mean 
cancer age of 65 years. 

3. Ethics statement 

The study of the Mayak-workers cohort has been re¬ 
viewed and approved by the Southern Urals Biophysics 
Institute’s Review Board for issues related to privacy and 
personal data protection. 

B. Statistical analysis 

1. Multi-stage models 

Different multi-stage models are applied to the data. 
Their common rationale is to radically reduce the com¬ 
plexity of carcinogenesis to essentially two key processes: 
(i) mutations —or, generally, a series of (epi-)genetic 
transitions from healthy via pre-malignant to malignant 
stem-like cells— and (ii) proliferation (i.e., symmetric di¬ 
vision; cell inactivation or death) of pre-malignant cells 
with a selective advantage (40|.[^ 

Mathematically, this is modeled as continuous-time 
Markov processes for the (stochastic) numbers of cells, 
Xi{t), at the different stages {i = 0, ...,fc). Specifi¬ 
cally, at age t = 0, one starts with Xq = N healthy 
cells, which can make a transition (modeled as a Pois¬ 
son process) with rate fj-o to a first, premalignant stage, 
Ai.[69| These cells can then undergo a birth/death pro¬ 
cess with rates ai/ffi , leading to a net proliferation rate 
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Figure 1: Schematic structure of a fc-stage model (top) 
and the two-path model studied in this paper (below). 
Here, Xj denotes the stochastic number of cells at stage 
j, with arrows indicating transitions at rates /Xj, etc. 
(see text). Cancer is assumed to occur once the first 
malignant cell appears, with latency period tiag ^ 5 
years. 


7 i ~ Qfi — /3i. Further transitions eventually lead to ma¬ 
lignant cells, Xk- The occurrence of the first malignant 
cell is assumed to lead to cancer after an effective lag time 
flag, typically on the order of a few years. A cartoon de¬ 
piction is shown in Fig. [U for k = 2, this corresponds 
to the standard two-stage model with clonal expansion 
(TSCE). 

The mathematical model above can be solved for 
the survival function S(t) and, equivalently, the hazard 
(here: lung-cancer mortality rate), h = using 

the method of characteristics [^. For the two-stage 
model, assuming rate parameters to be piecewise age- 
independent, an exact closed-form solution can be at¬ 
tained [13 ■ In the general case, an extension of this so¬ 
lution is valid approximately if age bins are small enough 
for intermediate-cell numbers to change slowly. 

As an illustration, let us highlight some generic fea¬ 
tures shared by all such multistage models. At earlier 
ages, the hazard is well described by a deterministic 
model, h ~ in terms of the mean numbers of 

cells, Xj = iij-iXj-i + 'jjXj [dlj. This leads to an ini¬ 
tially polynomial growth, h(t) ~ Nfig ■ ■ ■ /{k — 

1 )!, followed by a rapid proliferation-driven phase, h{t) = 
where 7 denotes the maximum growth rate. How¬ 
ever, this deterministic approximation fails to account for 
the effective reduction in available premalignant cells as 
new malignancies arise: Whenever a person reaches the 
cancer endpoint, those cells can no longer lead to fur¬ 
ther cancer cases. At older ages, approximately around 
the mean cancer age, a steady state is reached between 
growth and effective “loss” of premalignant cells [4l| , and 
the hazard levels off to a constant limit, /iqo ~ N 
These borderline cases also illustrate a more general 
point: Not all biological rates can be determined from 
fits to the cancer data alone. Generically, only certain 


parameter combinations are identifiable (d^ . 

In the discussion so far, we have tacitly assumed a lin¬ 
ear series of transitions for simplicity. However, we also 
test a model where mutations may occur along two dif¬ 
ferent pathways, as sketched in Fig. [IJbottom). Such a 
model has been introduced by Little et al. a to account 
for gen omic instability, inspired by models for colon can¬ 
cer [Ij|. The underlying idea is that a second path, ac¬ 
tivated via transition rates aj , corresponds to the loss of 
a gene involved in maintaining genomic integrity. This 
may lead to mutation rates much larger than for the ge- 
nomically stable (upper) path, 3> iij (3. Despite 
the more complex topology, the model equations are con¬ 
structed and solved using exactly the same principles as 
outlined above. 


2. Risk modeling 

In the framework of multi-stage models, the hazard is 
fully determined by the (generally time-dependent) mu¬ 
tation and growth rates. These parameters are essen¬ 
tially assumed to have an age-independent background 
value which may be modified by external risk factors such 
as radiation dose rate, d{t)^ and smoking, s(t), here as¬ 
sumed to start from 18 years on. In practice, both risk 
factors are allowed to independently increase any of the 
rate parameters d; G {/tq, /ri,..., 71 ,...}; from all possi¬ 
ble combinations {'di((i), d;/(s)}, the best-fitting model is 
selected. Depending on which of these rates show a ra¬ 
diation effect, the radiation risk varies in a characteristic 
fashion with age [i^, as well as with modifiers such as 
duration of or age at exposure. 

This is in marked contrast to conventional descriptive 
models employed in radiation epidemiology (^ . which 
we also use to benchmark our results. In a descriptive 
model, the hazard function is modeled directly - rather 
than its underlying mechanism. Here we use the conven¬ 
tional parametrization 

Ii(i) = /ibsi(i)[l+ERR(D,t,...)], (1) 

where the baseline, /ibsi (0 = ^ is factored 

into terms for age-dependent background, = 

Sj=o'"j (efe)’ other risk factors, ipri (birth year, 

smoking and alcohol, etc.). The excess relative risk, 
ERR, is factored into dose-response shape—typically 
a function of cumulative dose, D{t — tiag)—and time- 
dependent modifiers such as attained age or age at expo¬ 
sure, see Appendix. 


3. Fit procedure 

All model parameters are estimated by maximizing 
their likelihood L, which is constructed u sing the indi¬ 
vidual likelihoods of all cohort members [^. Equiva¬ 
lently, we minimize the deviance, T) = — 21 nL > 0 , us¬ 
ing the Minuit function-minimization library For 
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Table I: Synopsis of the best models in this study, along 
with figures of merit for their goodness of fit (see text 
for details). The columns labeled d and s indicate the 
model’s parametric dependence on dose(rate) and 
smoking-related confounders; e.g., 7 = 7 (°)(s) -|- 6^{d) 
for the TSCE model. 


Model 

s 

d 

^ Parameters“ 

Deviance 

AIC 

descriptive ipcf ERR(D) 

9 

4770.3 

4788.3 

TSCE 

7 

7 

10 

4760.7 

4780.7 

3SCE(A) 


71 

11 

4749.9 

4771.9 

3SCE(B) 

72 

71 

11 

4756.7 

4778.7 

3SCE(C) 

71 

72 

11 

4757.7 

4779.7 


“Counting parameters for background, smoking/alcohol, 
birthyear , and Pu; see Appendix. 


model selection, we rely on the likelihood-ratio test for 
nested models so as to retain only parameters signifi¬ 
cant at a 95% confidence level. The same level is also 
adopted throughout this paper for confidence intervals. 
To rank non-nested models, the entropy-based Akaike in¬ 
dex is used jl^, AIC = 'D-\-2n, which effectively penalizes 
overfitting for models with larger number of parameters 
n. 


III. RESULTS 


Table |T] provides an overview of the best-fitting mod¬ 
els. Before explaining in detail their mechanisms and 
the implications for radiation risk, let us anticipate some 
general patterns. All highest-ranked multi-stage mod¬ 
els share a Plutonium-induced enhancement of prolifera¬ 
tion rates. More specifically, the fits suggest that 3-stage 
models with a radiation effect on an early stage of prolif¬ 
eration (models A, B) may yield an improved description 
of the Mayak data, compared with an effect on the penul¬ 
timate stage (C). Although model A exhibits by far the 
lowest AIC, we will present the radiation risks for sev¬ 
eral three-stage models so as to give an impression of the 
range of model predictions, as discussed in the Appendix. 

No evidence is found for a radiation-induced second 
pathway. We note that all dose rates here refer to internal 
Pu exposure; external radiation is not significant in two- 
stage and descriptive models and thus not considered in 
the following. 


A. Two-stage model 

The two-stage model has been applied to previous 
follow-ups of the Mayak workers (^. Id^. We shall there¬ 
fore discuss it here as a benchmark, but also to illustrate 
some mechanisms inherent in any multistage model. 
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Figure 2: Dependence of the proliferation rate, 'j{d), on 
internal (lung) dose rate in the TSCE model. For 
comparison, the dots illustrate the results of a 
categorical fit (with 95%-level errors). 


We find that the effects of both Pu dose rate as well 
as smoking status are by far best described as addi- 
tively enhancing the net proliferation rate, 7 ( 3 , d) = 
7 *'°^(s) -I- d 7 (d).[^ A categorical fit of the dose depen¬ 
dence strongly suggests a saturation at larger dose rates 
(Fig. (2), and we find it best modeled by an exponentially 
leveling function 

Slid) = 7oo (1 - . (2) 

Here r ~ 5/Gy (see Table mil) governs the linear, low- 
dose response, Sj ^ r d, and 700 ~ 0.3/a denotes the 
rate approached as d d* = 7 oo/?'; here d* ~ 0.06Gy/a. 
This is qualitatively in line with previous analyses 
^1 but also several Radon-risk studies [ 2 ^, 

30|, see Discussion. It is worth noting that the data fit 
equally well to a response in terms of the accumulated 
dose D - i.e., 6i{D) in ([5]) - which may relate to the long 
protracted exposures of Mayak workers. 

It should be stressed that, even though the main risk 
factors, smoking and Plutonium, enter the growth rate 
additively, the actual risk will exhibit an interaction be¬ 
tween them. To illustrate this, in Fig. [3a|we display the 
age-dependent hazard, d(t), for a representative exposure 
scenario at a constant dose rate from age ti = 25a to 60a, 
with dose D = 0.2Gy. For a wide age range, coinciding 
with the phase of exponential proliferation, the relative 
risks of radiation and smoking approximately multiply 
(note the log scale). It is only at larger ages (> 60a) that 
the combined risk drops below the multiplicative value. 
Such sub-multiplicity agrees with trends glimpsed in a de¬ 
scriptive analysis [^. Here, it follows naturally because 
the hazard of Pu-exposed smokers levels off much earlier, 
reflecting an earlier onset of cancer (see Methods). 

To better link these findings to those of descriptive 
models (see Appendix), we will from now on consider 
the excess relative risk, ERR = /i//ibsi — Ij defined rela¬ 
tive to the zero-exposure baseline risk. From this angle, 
multiplicity of risk implies an equal ERR for smokers and 
non-smokers - i.e., a ratio ERR*'®“^^/ERR*'®“°^ = 1. Fig¬ 
ure |3b] shows that, under the scenario above, the risk is 
indeed multiplicative until older ages {t < 60a). It then 
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age (a) 

(a) Hazard h{t) of the two-stage model for 
non-/smokers. Shown are values without (baseline) 
and with exposure. (Smoking is assumed to start at 
age 18.) 



(b) Excess-relative-risk (ERR) ratio of smokers and 
non-smokers, shown for different multi-stage models 
(see text). A ratio of 1 indicates multiplicative 
Plutonium-smoking interaction, as in the standard 
descriptive model. The thin-dotted line denotes the 
(non-significant) sub-multiplicative trend suggested 
by an extended descriptive model. 

Figure 3: Risk for a scenario with constant exposure 
between ages 25 and 60, a.t D = 0.2Gy. Notice the lag 
time of 5a. 


becomes sub-multiplicative, and the ERR ratio drops 
markedly below unity after (time-lagged) exposure ends. 

Since most cases are related to smoking, we will now 
concentrate on the risk for smokers. Moreover, to sep¬ 
arate age and dose dependencies, we scale the ERR by 
the accumulated dose, ERR(Z3; — bag), with the 

lag time flag = 5a. Figure 0] displays the age-dependent 
ERR/Z? for the scenario above but at D = 0.5Gy. Eor 
the two-stage model, it reveals a characteristic increase 
during exposure (owing to proliferation), followed by a 
marked drop-off after exposure has ended. The latter is 
similar to the (non-significant) attained-age trend found 
in the descriptive model, cf. Appendix. 

The age dependence just discussed pertains to one spe¬ 
cific scenario. Let us now indicate how the risk is mod¬ 
ified by different exposure patterns. The dependence 
upon age at first exposure, ti, is shown in Fig. [Sa] We 
have chosen a scenario with D = 0.2Gy and a typical du- 



Fignre 4: Age-dependent excess relative risk (ERR/ZZ) 
of different multi-stage models, for smokers with 
constant exposure between ages 25 and 60 {D — 0.5Gy). 

Eor comparison, the non-significant trend in the 
descriptive model (thin-dotted line) is also shown. (All 
error bars are at 95% confidence level.) 


ration At = 20a; the ERR is recorded at ti + At + tiag- 
Glearly, for this two-stage model, the variation is rather 
mild for all but very early exposures (not encountered at 
Mayak) and very late ones. In the former case, virtu¬ 
ally no premalignant cells are available for proliferation. 
At older ages, in tnrn, they are increasingly lost to new 
malignancies; thus the risk is attenuated. In the descrip¬ 
tive model, a weak (non-significant) trend also suggests a 
slight decrease of ERR with older ages at exposure (not 
shown). 

Eignre [13 reveals a characteristic influence of exposure 
duration. At (where D = 0.2Gy, ti = 20a). Eor very 
short exposures. At <g; ZZ/d, 3a, the ERR is strongly 
suppressed: The short duration cannot be compensated 
by an increased dose rate, since the growth rate satu¬ 
rates at dose rates larger than d*. This inverse dose- 
rate effect is thus inherently connected to a saturating 
radiation response as in Eq. (ED- It has been observed 
also in mechanistic Radon studies [U [2^ [s^ , although 
the Mayak data are not powerful enough to support this 
at the descriptive level. At sufficiently long exposures. 
At ^ ZZ/d,, the inverse dose-rate effect disappears and 
eventually gives way to a slight direct effect. This is 
related to the leveling of the hazard upon reaching ma¬ 
lignancy. 

To wrap up this discussion, let us consider the dose- 
response relationship implied by this proliferation-based 
model. Eigure [6] illustrates that, in contrast to the linear 
dose response typically assumed in descriptive modeling 
(ERR(ZZ) = cZZ, here c ~ 4.7/Gy), this model exhibits 
a nonlinear response. Although the quantitative values 
depend on the exposure scenario (here: exposure during 
t = 25 — 60a), some general features apply to any two- 
stage model with a proliferation enhancement similar to 
Eq. (ED: The dose response is characterized by a lin¬ 
ear low-dose regime, ERR ~ r x ZZ for ZZ -C 1/r, and 
leveling at sufficiently large dose(rate)s and/or ages. In- 
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(a) ERR/D for different ages at first exposure, ti (at 
fixed duration At = 20a). 



(b) ERR/D for different durations At (at fixed 
ti = 20a). 

Figure 5: Dependence of ERR/D on different exposure 
patterns, for smokers with D = 0.2Gy. 



0 0.2 0.4 0.6 0.8 1 1.2 1.4 

Internal lung dose (Gy) 

Figure 6 : Dose-response relationship, ERR(D), for 
exposure during ages 25 to 60, recorded at age t = 65. 
For comparison, the linear response ERR = cD is 
plotted. 


termittently, typically an exponential increase is seen, re¬ 
flecting the exponential growth of premalignant cells due 
to proliferation. However, this effect tends to be washed 
out by the leveling at older ages. In a descriptive model 
fit, where essentially an averaging occurs over all expo¬ 
sure histories in the cohort, this pronounced nonlinearity 


will be even harder to resolve. Still, it is noteworthy that 
a TSCE-inspired descriptive model, ERR = — 1 

with f{D) = Coo(l — yields an improved fit 

(but similar AIC), with a linear response cun ^ 3.2/Gy 
and leveling at D* = Coo/cun O.SGy. 


B. Three-stage models 

As mentioned earlier, the highest-ranked 3-stage mod¬ 
els fall into two categories. 

The best two models (A, B; see Table lHll) show a radia¬ 
tion effect on the earlier stage of proliferation, 71 (d), dif¬ 
fering only in their background parameters - specifically 
the smoking response on /xi (A) or 72 (B). This early im¬ 
pact leads to a substantially delayed radiation response 
compared to both the 2-stage model and 3-stage model 
C, as is seen from Fig. U) The ERR/D is initially zero 
and, once it has peaked, tends to drop off more mildly. 
(In some scenarios, the ERR/Z? may even increase when 
exposure stops.) This lag is intuitive, as the insulted 
cells need to pass through an additional mutation stage 
before becoming malignant. In stark contrast, model C— 
with an effect on the penultimate stage—predicts a much 
higher risk, ERR/D ~ r ^ 10/Gy, right after exposure. 
In the scenario depicted in Fig. 01 it further displays an 
almost monotonic drop with attained age, not unlike the 
trend seen in the descriptive model risk. 

However, it must be stated that this specific scenario 
conceals an underlying complexity not present in the 2 - 
stage model. The reason is that the 3-stage model is 
determined by two competing growth rates, 71 and 72 . 
Let us consider models A/B: For a small enough dose rate 

such that 7 ^°^ +rd < 72 °^, the clonal dynamics is governed 
by the largest baseline rate, 72 °^ In other words, below 

the critical dose rate dcrit = ( 72 °^ “ 7i°^)/^~ O.OlGy/a, 
both baseline and excess risk grow with the same expo¬ 
nential rate - hence the ERR would level off with age. It 
is only above that critical dose rate that an exponential 
increase is seen in the ERR. Likewise, for model C, the 
critical point, ( 7 ^°^ ” 72 °^)/''’) marks the dividing line be¬ 
tween exponential increase and leveling with age. Notice 
that the value in Fig. |T]is just at the borderline. 

The dose-rate dependence just described is reflected in 
the dose response (Fig. [ 6 ]). At low doses, a linear increase 
is seen, just as for the two-stage model. However, that 
low-dose response is typically much lower than what we 
saw for two-stage and descriptive models. It is only at 
the critical dose, here dcritAf ~ 0.35Gy, that a rapid 
exponential increase sets in. Gompared to the two-stage 
case, this exponential increase is much sharper owing to 
the higher response coefficients, r ^ (10 — I7)/Gy. As 
before, the response levels off once the corresponding dose 
rates exceed d* ~ (0.03 — 0.05)Gy/a. 

Another consequence of an early-stage radiation effect 
is a notably suppressed risk for older ages at exposure, ti 
iFig. ISal) . To understand this, recall that the radiation 
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effect consists in multiplying the available pool of stage- 
1 cells, Xi(ti), prior to their making the transition to 
stage 2. Since that number grows only very slowly at a 
rate 7 ^°^ < 72 °^, the head start given to already existing 
stage -2 cells, X 2 {ti), becomes overwhelming the later in 
life exposure starts. Thus the ERR is suppressed expo¬ 
nentially as 6“*-'*'2 - 7 i )ii for large ti. By contrast, the 
mechanism for model C is fairly similar to that of the 
2-stage model: Both radiation and baseline risk are gov¬ 
erned by the growth of existing stage-2 cells. This is why 
virtually no dependence on is seen other than a mild 
drop-off for older ages at exposure, due to the onset of 
malignancies. 

Differences with respect to the two-stage model may 
also be observed in the exposure-duration dependence 
(Fig. I5bll . Like the 2-stage models, all 3-stage models 
exhibit an inverse dose-rate effect for durations At < 
D/dt:. As explained previously, this merely hinges on the 
saturation of the proliferation rate for higher dose rates. 
It may be noted that the early-stage models A/B lead to 
a stronger risk suppression at such short durations, due to 
the extra mutational step to be passed. However, a more 
qualitative difference is found for longer durations: Here 
a marked direct effect occurs, the ERR falling off as At 
This may be viewed as a linear dose-rate modification for 
perturbatively small dose rates D/At in the limit of long 
durations. 

Finally, let us comment on the interaction between 
radiation and smoking risks. We saw earlier that the 
two-stage model predicts a largely multiplicative inter¬ 
action or, equivalently, a near-unit ratio of smokers’ 
and non-smokers’ ERR (Fig. I3bll . The situation is less 
clear cut for the three-stage models. Model C, with a 
later-stage radiation effect, is most comparable to the 
2-stage case: Initially, it also exhibits near-multiplicity, 
which reflects that radiation simply leads to multipli¬ 
cation of existing stage-2 cells. However, the ERR ra¬ 
tio then drops very rapidly. This is essentially because 
for smokers, the dose rate is below the critical value, 
( 7 ^°^ — 'y^'^)/r ~ O.OlGy/a, as smoking leads to a very 
large growth rate 71 (s = 1) Ri 0.16a“^. Thus the ERR 
levels off, in contrast to the exponentially growing ERR 
for non-smokers, where there is no threshold. An anal¬ 
ogous mechanism is at work in the model B, which is 
sub-multiplicative throughout. 

By contrast, for model A, the risk in the scenario shown 
in Fig. I3bl is strikingly super-multiplicative except for 
ages t > 70a. At first glance, this deviation from mul¬ 
tiplicity might seem surprising: For constant rates, the 
hazard is typically proportional to pi(s), and the smok¬ 
ing dependence should thus drop out of the relative risk. 
However, smoking is assumed to start at age 18, only a 
few years prior to irradiation. Hence the baseline risk— 
initially proportional to the number of existing stage -2 
cells—mostly stems from those cells created before smok¬ 
ing started, and is thus comparable for noii-/smokers. 
Thus, the large ERR for smokers reflects the much higher 
excess risk due to freshly mutated stage-1 cells. It is only 


long after the start of exposure that the smokers’ baseline 
risk increases sufficiently to compensate for this. 


C. Two-pathway models 

We have tested a family of multi-path models of ge¬ 
nomic instability (GI). Let us briefly outline the key as¬ 
sumptions made here to reduce the number of parame¬ 
ters - in total, 6 mutation rates and 6 cell-division/death 
rates, as sketched in Fig. [TJ Most premises are moti¬ 
vated by the biological mechanisms thought to underlie 
GI [141 |43|. First, the destabilizing transition rates are 
set equal, aj = a, as they pertain to the inactivation of 
the same genomic-integrity gene. Since, presumably, GI 
per se does not lead to any growth advantage, we set the 
birth/death rates at stage 0 *^^ equal, a® = their 
precise value being marginal. Mutation rates following 
GI, are supposed to be larger, or at least equal, com¬ 
pared to their genomically stable (upper-branch) coun¬ 
terparts. 

Although GI may well be present as a sporadic mecha¬ 
nism, the baseline data are not thought to provide suffi¬ 
cient structure to distinguish complex background mod¬ 
els (see Discussion; cf. also Ref. HI)- We have thus 
focused on the case of radiation-induced GI - i.e., an ac¬ 
tivation of the otherwise silent GI path (u) by radiation. 
This may be modeled as cr = tgi x d (with vanishing 
background rate). In addition, radiation may affect reg¬ 
ular mutation or growth rates. 

None of the tested two-pathway models has led to any 
significant, numerically stable improvement beyond the 
benchmark two-stage model. It is stressed that relaxing 
the assumptions of linearity or vanishing background rate 
do not yield a significant improvement. 


IV. DISCUSSION 

A. Mechanism of radiation action 

A robust result of our analysis is an a-radiation- 
induced enhancement of proliferation rates of premalig- 
nant cells. This corroborates a consistent finding in many 
studies based on fitting two-stage models to lung-tumor 
data, both from epidemiological cohorts (Radon-exposed 
mine rs 1 ^ . amaiiai) and animal experiments (see 
Ref. for an overview). It also alleviates concerns that 
a proliferation effect might be confined to the two-stage 
model (^ . 

However, it has long been criticized that no ra¬ 
diobiological evidence exists for such a premalignant- 
growth-enhancing effect. Although experimental evi¬ 
dence is still sparse Haim, let us briefly discuss some 
theoretical models put forward to explain how radia¬ 
tion might lead to enhanced proliferation of premalignant 
cells. 


Table II: Observed numbers of lung-cancer cases by dose categories, compared with those predicted by the 
descriptive and multi-stage models (in brackets: excess cases). As a reference, we also give the person years (py). 


Doses (Gy) 

py 

Cases 

TSCE 

3SCE(A) 

3SCE(B) 

3SCE(C) 

descriptive 

0 -0.01 

188, 995 

204 

198 (0.5) 

203 (0.2) 

205 (0.2) 

204 (0.3) 

194 (0.6) 

0.01-0.03 

18,888 

32 

50 (3) 

49 (1) 

50 (1) 

50 (1) 

49 (4) 

0.03-0.1 

15, 843 

58 

51 (9) 

47 (4) 

47 (4) 

48 (4) 

52 (11) 

0.1 -0.3 

7, 091 

41 

37 (15) 

31 (8) 

30 (23) 

31 (7) 

39 (17) 

0.3-1 

3,153 

29 

33 (24) 

34 (25) 

31 (21) 

31 (21) 

32 (23) 

> 1 

738 

24 

18 (16) 

24 (21) 

25 (22) 

25 (22) 

22 (20) 

total 

234,708 

388 

388 (67) 

388 (59) 

388 (56) 

388 (56) 

388 (75) 


The most elaborate model is based on the following 
idea (1^: Radiation kills cells, which in turn triggers di¬ 
vision of neighboring stem cells so as to replenish the 
lost tissue. This may lead to net proliferation of prema- 
lignant cells if these have a selective advantage, that is, a 
slightly higher rate for cell division than needed for home¬ 
ostasis. (Strictly speaking, cells need not necessarily be 
killed; it might be sufficient for them to have a prolif¬ 
erative disadvantage—e.g., by effected cell-cycle arrest— 
such that these are subsequently repelled by premalig- 
nant cells.) That hypothesis has been shown to 

lead to a dose-rate response, 7 (d), which is qualitatively 
similar to that found in cohort studies (see, e.g.. Fig. la, 
albeit with quantitatively modest agreement. Saturation 
of the growth rate much higher than a characteristic dose 
rate, d !§> dg, occurs if more cells are killed than can be 
substituted for by premalignant cells. 

To obtain a very rough estimate for that critical dose 
rate, note that a-particle hits of a cell nucleus are in¬ 
dependent and rare. Thus the number of hits, N, is 
Poisson-distributed, Pn = e~^/Nl, which means the 
fraction of cells not hit is Pn=o = . Assu ming (i) a 

linear dose response, N = nD, with n ^ 4/Gy |^, and 
(ii) delivery of the dose D = dr over a characteristic 
time of order the interval between cell cycles (r ^ la 
for basal stem cells na), we have PN=o{d) = e At 
the characteristic dose rate, dg, about one out of, say, six 
nearest neighbors would be hit, such that Po{ds) = 1—p, 
p= 1 / 6 , yielding a characteristic dose rate of 

ds^ — ^ 0.04—. (3) 

nr a 

This is on the same order of magnitude as the value found 
in this study, d* ^ (0.03 — 0.06)Gy/a. In this light, the 
model results presented here and the repopulation mech¬ 
anism may be interpreted to be compatible. However, 
it is important bearing in mind that these estimates are 
naturally crude. Matters are further complicated by the 
spatially inhomogeneous energy deposition within differ¬ 
ent spots in the lungs, an effect not reflected in whole- 
lung doses used here [^ . 

As an alternative mechanism, a radiation-induced 
disturbance of cell communication has been suggested 
d, [ 5 ^. This may lead to, e.g., up-regulated growth sig¬ 


nals or a reduction of apoptosis |^, with a higher 
effect on intermediate cells because those tend to evade 
homeostatic control. It has even been proposed that a 
proliferation enhancement, mediated by such a bystander 
signaling, might be the generic mechanism for the re¬ 
sponse to densely ionizing radiation (^ . However, no 
mechanistic model has been put forward explaining in 
detail how this might lead to a dose-rate response, 7 (d). 

Even so, should the radiation response indeed be gov¬ 
erned by the bystander effect, then a similar behavior 
ought to be expected as for bystander-mediated muta¬ 
tion induction . In microbeam experiments (^ . , 

it has been found that for low doses—corresponding to 
less than ^ 10 % of cells being hit—the mutagenic yield 
was strongly amplified as bystander cells also received 
signals. (A similar pattern has been found for inter¬ 
cellular induction of apoptosis @|.) For much higher 
doses, in turn, the response essentially saturated. Along 
the lines leading to Eq. ©, we can estimate the char¬ 
acteristic dose rate dg by assuming the crossover to oc¬ 
cur at a fraction p = 0.1 of cells being hit. This yields 
ds ~ p/nr ~ 0.025Gy/a, under the same caveats as men¬ 
tioned above. From this standpoint, both bystander sig¬ 
naling and the repopulation hypothesis appear compati¬ 
ble with the dose-rate response found in this study. 


B. Comparison with previous studies 

As discussed earlier on, for mechanistic models, the 
risk is essentially determined by its structure, particu¬ 
larly, the radiation response. A dose-rate dependent pro¬ 
liferation rate, j{d), saturating for larger dose rates (as 
in Eq. U) is found in many studies applying the two-stage 
model to a-particle-induced lung cancer. In particular, 
this response quantitatively agrees with that of the pre¬ 
ferred two-stage model by Jacob et al. for the Mayak 
cohort, both for Plutonium and smoking [s^. Goncor- 
dantly, their risk estimates are similar to those of the 
present TSCE model - such as a cohort-averaged excess 
risk of ERR(t = Q0a,)/D ~ 4/Gy, a nonlinear dose de¬ 
pendence ERR(iJ) for larger doses, and sub-multiplicity 
of smoking and radiation risks. 

In a recent descriptive analysis of the Mayak data. 
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Gilbert et al. found a linear dose response, modified sig¬ 
nificantly only by a drop-off with attained age (113 , see 
also Appendix). The value at age 60, ERR(t = 60a) ~ 
7D/Gy, is somewhat higher than for the cohort average 
of the mechanistic models presented here. By contrast, 
the multi-stage models exhibit a strongly nonlinear dose 
dependence especially for higher doses. Furthermore, 
they typically display a decrease with attained age only 
for large enough ages, most pronounced after the end of 
exposure. Initially, an increase with age is seen due to 
exponential clonal growth, at least for high enough doses. 

In contrast to descriptive models, where an exposure 
modifier may not be significant when parametrized ex¬ 
plicitly, mechanistic models implicitly make predictions 
for the risk dependence on any exposure scenario. This is 
exemplified by the age-at-exposure dependence or the in¬ 
verse dose-rate effect shown by (at least some) mechanis¬ 
tic models (Fig. [5]). Furthermore, a non-significant trend 
in Ref. indicated a sub-multiplicative interaction be¬ 
tween Plutonium dose and smoking. This is in agreement 
with results from the 2-stage model, which further offers 
a mechanistic interpretation in terms of exponential cell 
growth, combined with earlier malignancies for smokers 
(see Results). 


C. Implications for lung carcinogenesis 

We have shown that several three-stage models give 
an improved description of the data compared to one in¬ 
volving two stages. Evidently, this stochastic inference 
is based solely on the lung-cancer endpoint and cannot 
replace experimental insight into the dynamics of inter¬ 
mediate stages. Even so, it is worth emphasizing that 
the Mayak data do not fully allow to distinguish general 
mechanisms for carcinogenesis. Rather, the results here 
mostly rely on the radiation-associated risk. In fact, the 
deviances of the various mechanistic and descriptive base¬ 
line models (risk factors age and smoking) do not differ 
noticeably - in contrast to the models including exposure 
(Table U). 

That statement seemingly contradicts the fact that 
there is only a fraction ^ 25% of excess cases relative 
to the baseline (as can be seen from ERR ~ 5D/Gy, 
the whole-cohort average dose being D ~ 0.05Gy). This 
translates to 60 — 70 excess cases 1 Table Hill, compared 
to about 320 baseline cases. However, truly spontaneous 
baseline cases (30 — 40) are outnumbered by smoking- 
related ones by a factor of ~ 10. Moreover, the smoking 
variable is only binary (and noisy). This makes it hard 
to resolve the actual baseline risk accurately, especially 
since the multi-stage models do not differ in their quali¬ 
tative behavior except for young ages. By contrast, the 
models do differ markedly for different irradiation sce¬ 
narios as found in the Mayak data. 


V. CONCLUSION 

We have shown that carcinogenesis models extended to 
three stages offer an improved description of the Mayak 
lung-cancer data, as compared to the two-stage model. 
All favored 3- and 2-stage models indicate a radiation- 
enhanced proliferation rate of premalignant cells, sug¬ 
gesting that this is a robust finding not limited to the 
framework of two stages. Despite that structural simi¬ 
larity, the models make qualitatively different predictions 
for the risk following certain exposure scenarios. For in¬ 
stance, those models whose radiation impact is on an 
earlier stage exhibit a strongly suppressed risk for older 
ages at exposure. As opposed to the two-stage case, 
all three-stage models reveal a soft-threshold dose(rate) 
above which the excess risk increases sharply. More¬ 
over, while an inverse dose-rate effect is predicted by all 
models, only those with three stages also display a pro¬ 
nounced direct effect for longer exposure durations. 

One aim of this study has been to elucidate which as¬ 
pects of carcinogenesis models are persistent themes or 
rather model-specific features. Such a better understand¬ 
ing should facilitate the development of improved car¬ 
cinogenesis models, both mechanistic but also descrip¬ 
tive ones. There is still some way to go toward a more 
realistic description. Ultimately, this would involve bi¬ 
ological input on rates or premalignant stages so as to 
cut the number of undetermined parameters. Gloser at 
hand, a natural next step may be validating the current 
models on other data sets. An extension to more than 
three stages conceivably provides a further improvement. 
Moreover, to get a more accurate description of the bi¬ 
ological mechanisms, it is desirable to develop models 
specifically for the different histological cancer subtypes. 
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Appendix A: Parameter estimates 

We now present some details on the model results. 

The reference descriptive model closely follows 
Ref. (^ . The main risk factor is smoking, which is in¬ 
cluded simply as a factor in Eq. ([1]), m 

10. The alcohol status further increases the baseline risk 
by g^Aaic _ g 0 . 6 ±o.i ^ g_ addition, the birth year was 
found to elevate the risk between years 1915 and 1935 
by about gO-^io i ~ 15 ^ With 3 extra parameters, this 
birth-cohort effect is moderately significant {AT? = —11). 
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It might be interpreted as a smoking modifier (as which 
it yields only a slightly higher deviance), possibly related 
to the changed smoking levels between the wars. The 
effect is irrelevant for the radiation risk, and generally 
no birth-cohort distinction is made in the scenarios in 
Figs. [SHU We mention that a calendar-year dependence 
has been tested but discarded: It oscillates strongly but 
shows no conclusive trend, nor does it influence the dose 
response. 

The dose dependence is modeled as a linear function, 
ERR = cD, c = (4.7 ± 0.9)Gy“^, with no evidence for 
threshold or quadratic terms. Note that the lag time 
entering the dose, D = D{t — bag), does not alter the 
deviance significantly between 0 — 10a. We fix it at 
bag = 5a, also for all multi-stage models. No signif¬ 
icant time-dependent dose modifiers are found. How¬ 
ever, a trend suggests a decrease of ERR/H with at¬ 
tained age, modeled as (t/60a)“^ '^^^’®, and a marginal 
decrease with median age at exposure. Even though the 
reference model © implies multiplicative risks of smok¬ 
ing and radiation, a non-significant trend indicates sub¬ 
multiplicity, with an ERR for smokers reduced by a factor 

g-0.9(±1.5) _ Q 4_ 

The maximum-likelihood parameters of the TSCE 
model are shown in Table HU We have omitted two co¬ 
variables not relevant for the radiation risk: alcohol and 
birth year. Both are included as addends to the growth 
rate 7 , consistent with an interpretation as smoking sur¬ 
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